Search results for "Computational Linguistics"

showing 10 items of 210 documents

On the empirical spectral distribution for certain models related to sample covariance matrices with different correlations

2021

Given [Formula: see text], we study two classes of large random matrices of the form [Formula: see text] where for every [Formula: see text], [Formula: see text] are iid copies of a random variable [Formula: see text], [Formula: see text], [Formula: see text] are two (not necessarily independent) sets of independent random vectors having different covariance matrices and generating well concentrated bilinear forms. We consider two main asymptotic regimes as [Formula: see text]: a standard one, where [Formula: see text], and a slightly modified one, where [Formula: see text] and [Formula: see text] while [Formula: see text] for some [Formula: see text]. Assuming that vectors [Formula: see t…

Statistics and ProbabilityPhysicsAlgebra and Number TheorySpectral power distributionComputer Science::Information RetrievalProbability (math.PR)Astrophysics::Instrumentation and Methods for AstrophysicsBlock (permutation group theory)Marchenko–Pastur lawComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Bilinear form60F05 60B20 47N30Sample mean and sample covarianceCombinatoricsConvergence of random variablesFOS: Mathematicssample covariance matricesComputer Science::General LiteratureDiscrete Mathematics and CombinatoricsRandom matriceshigh dimensional statisticsStatistics Probability and UncertaintyRandom matrixRandom variableMathematics - ProbabilityRandom Matrices: Theory and Applications
researchProduct

Probabilities to Accept Languages by Quantum Finite Automata

1999

We construct a hierarchy of regular languages such that the current language in the hierarchy can be accepted by 1-way quantum finite automata with a probability smaller than the corresponding probability for the preceding language in the hierarchy. These probabilities converge to 1/2.

Discrete mathematicsTheoretical computer scienceNested wordFinite-state machineHierarchy (mathematics)Computer scienceComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Turing machinesymbols.namesakeNonlinear Sciences::Exactly Solvable and Integrable SystemsRegular languageProbabilistic automatonAnalytical hierarchysymbolsComputer Science::Programming LanguagesQuantum finite automataQuantum algorithmNondeterministic finite automaton
researchProduct

ON-LINE CONSTRUCTION OF A SMALL AUTOMATON FOR A FINITE SET OF WORDS

2012

In this paper we describe a "light" algorithm for the on-line construction of a small automaton recognising a finite set of words. The algorithm runs in linear time. We carried out good experimental results on real dictionaries, on biological sequences and on the sets of suffixes (resp. factors) of a set of words that shows how our automaton is near to the minimal one. For the suffixes of a text, we propose a modified construction that leads to an even smaller automaton. We moreover construct linear algorithms for the insertion and deletion of a word in a finite set, directly from the constructed automaton.

minimal automata[INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS]Timed automatondeterministic automataBüchi automaton0102 computer and information sciences02 engineering and technology01 natural sciencesDeterministic automaton0202 electrical engineering electronic engineering information engineeringComputer Science (miscellaneous)Two-way deterministic finite automatonNondeterministic finite automatonMathematicsonline construction.Discrete mathematicsSettore INF/01 - InformaticaPowerset constructionPushdown automatonComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)010201 computation theory & mathematicsProbabilistic automaton020201 artificial intelligence & image processingFinite set of wordAlgorithmComputer Science::Formal Languages and Automata Theory
researchProduct

Puentes entre la Lingüística computacional y la Psicolingüística

2011

[EN] Cognitive sciences have assumed that there can be relationships between various disciplines such as Philosophy, Linguistics, Anthropology, Artificial Intelligence, or Psychology. This work aims to make explicit these relations between the Psycholinguistics and Computational Linguistics.

Cognitive scienceLinguistics and LanguagePsycholinguisticsLingüísticaComputational linguisticsPsicolingüísticaLanguage and LinguisticsPsycholinguisticsLinguisticslcsh:Philology. LinguisticsInformationSystems_MODELSANDPRINCIPLESlcsh:P1-1091Cognitive sciencesLingüística computacionalCiencias cognitivasComputational linguisticsPsychology
researchProduct

Automata and forbidden words

1998

Abstract Let L ( M ) be the (factorial) language avoiding a given anti-factorial language M . We design an automaton accepting L ( M ) and built from the language M . The construction is effective if M is finite. If M is the set of minimal forbidden words of a single word ν, the automaton turns out to be the factor automaton of ν (the minimal automaton accepting the set of factors of ν). We also give an algorithm that builds the trie of M from the factor automaton of a single word. It yields a nontrivial upper bound on the number of minimal forbidden words of a word.

TheoryofComputation_COMPUTATIONBYABSTRACTDEVICES[INFO.INFO-DS]Computer Science [cs]/Data Structures and Algorithms [cs.DS]Büchi automaton0102 computer and information sciences02 engineering and technologyω-automaton01 natural sciencesTheoretical Computer ScienceCombinatoricsDeterministic automaton0202 electrical engineering electronic engineering information engineeringTwo-way deterministic finite automatonNondeterministic finite automatonMathematicsPowerset constructionLevenshtein automaton020206 networking & telecommunicationsComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Nonlinear Sciences::Cellular Automata and Lattice GasesComputer Science ApplicationsTheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES010201 computation theory & mathematicsSignal ProcessingProbabilistic automatonComputer Science::Programming LanguagesComputer Science::Formal Languages and Automata TheoryInformation Systems
researchProduct

Translingual text mining for identification of language pair phenomena

2016

Translingual Text Mining (TTM) is an innovative technology of natural language processing for building multilingual parallel corpora, processing machine translation, contextual knowledge acquisition, information extraction, query profiling, language modeling, contextual word sensing, creating feature test sets and for variety of other purposes. The Keynote Lecture will discuss opportunities and challenges of this computational technology. In particular, the focus will be made on identification of language pair phenomena and their applications to building holistic language model which is a novel tool for processing machine translation, supporting professional translations, evaluation of tran…

Machine translationLanguage identificationComputer sciencebusiness.industry05 social sciencessimilarity metrics02 engineering and technologycomputer.software_genre050105 experimental psychologycomputational linguisticsmultilingual information retrievalUniversal Networking LanguageCache language modelLanguage technology0202 electrical engineering electronic engineering information engineeringComputer-assisted translation020201 artificial intelligence & image processing0501 psychology and cognitive sciencesinformation extractionLanguage modelArtificial intelligencebusinesscomputerLanguage industryNatural language processing2016 Sixth International Conference on Innovative Computing Technology (INTECH)
researchProduct

Determination of m¯b/m¯c and m¯b from nf=4 lattice QCD+QED

2021

We extend HPQCD's earlier ${n}_{f}=2+1+1$ lattice-QCD analysis of the ratio of $\overline{\mathrm{MS}}$ masses of the $b$ and $c$ quark to include results from finer lattices (down to 0.03 fm) and a new calculation of QED contributions to the mass ratio. We find that ${\overline{m}}_{b}(\ensuremath{\mu})/{\overline{m}}_{c}(\ensuremath{\mu})=4.586(12)$ at renormalization scale $\ensuremath{\mu}=3\text{ }\text{ }\mathrm{GeV}$. This result is nonperturbative. Combining it with HPQCD's recent lattice $\mathrm{QCD}+\mathrm{QED}$ determination of ${\overline{m}}_{c}(3\text{ }\text{ }\mathrm{GeV})$ gives a new value for the $b$-quark mass: ${\overline{m}}_{b}(3\text{ }\text{ }\mathrm{GeV})=4.513(2…

QuarkQuantum chromodynamicsPhysicsParticle physics010308 nuclear & particles physicsComputer Science::Information RetrievalHigh Energy Physics::LatticeHigh Energy Physics::PhenomenologyComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Lattice QCDMass ratio01 natural sciencesRenormalizationLattice (order)0103 physical sciencesHigh Energy Physics::Experiment010306 general physicsPhysical Review D
researchProduct

On P-compatible hybrid identities and hyperidentities

1994

P-compatible identities are built up from terms with a special structure. We investigate a variety defined by a set ofP-compatible hybrid identities and answer the question whether a variety defined by a set ofP-compatible hyperidentities can be solid.

AlgebraMathematical logicSet (abstract data type)Structure (mathematical logic)History and Philosophy of ScienceLogicVariety (universal algebra)Computational linguisticsMathematicsStudia Logica
researchProduct

Editorial: Mining Scientific Papers: NLP-enhanced Bibliometrics

2019

International audience

Computer science[SHS.INFO]Humanities and Social Sciences/Library and information sciencestext miningBibliometrics050905 science studiescomputer.software_genrescientific papersscientometrics[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]Bibliography. Library science. Information resourcescomputational linguistics[SHS.HISPHILSO]Humanities and Social Sciences/History Philosophy and Sociology of Sciencesnatural language processing[SHS.LANGUE]Humanities and Social Sciences/LinguisticsComputingMilieux_MISCELLANEOUScitation content analysisbusiness.industry05 social sciencesScientometrics[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]Artificial intelligence0509 other social sciencesComputational linguistics050904 information & library sciencesbusinesscomputerNatural language processingZ
researchProduct

Languages with mismatches

2007

AbstractIn this paper we study some combinatorial properties of a class of languages that represent sets of words occurring in a text S up to some errors. More precisely, we consider sets of words that occur in a text S with k mismatches in any window of size r. The study of this class of languages mainly focuses both on a parameter, called repetition index, and on the set of the minimal forbidden words of the language of factors of S with errors. The repetition index of a string S is defined as the smallest integer such that all strings of this length occur at most in a unique position of the text S up to errors. We prove that there is a strong relation between the repetition index of S an…

Combinatorics on wordsApproximate string matchingGeneral Computer ScienceRepetition (rhetorical device)String (computer science)Search engine indexingComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Approximate string matchingData structureTheoretical Computer ScienceCombinatoricsSet (abstract data type)Formal languagesCombinatorics on words Formal languages Approximate string matching IndexingIndexingWord (group theory)MathematicsInteger (computer science)Computer Science(all)Theoretical Computer Science
researchProduct